Corpus: jpn_news_2005_300K

Other corpora

4.4.1.5 Number of Word-N-grams at Sentence Endings

Number of word-N-grams for N=1...5 for the first K sentences

K # of words # of bigrams # of trigrams # of 4-grams # of 5-grams
100 1 43 75 91 97
1000 1 291 574 809 935
10000 4 1508 3776 6274 8268
100000 8 5753 20445 42774 65750
1000000 16 9784 42924 101969 168786


Zipf's diagram for sentence endings


Gnuplot diagram

7009 msec needed at 2018-03-12 19:32